Picture for Yiming Zhao

Yiming Zhao

Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers

Add code
May 31, 2026
Viaarxiv icon

ACC: Compiling Agent Trajectories for Long-Context Training

Add code
May 21, 2026
Viaarxiv icon

Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech

Add code
Mar 20, 2026
Viaarxiv icon

Internalizing Agency from Reflective Experience

Add code
Mar 17, 2026
Viaarxiv icon

Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces

Add code
Mar 15, 2026
Viaarxiv icon

KoopmanFlow: Spectrally Decoupled Generative Control Policy via Koopman Structural Bias

Add code
Mar 14, 2026
Viaarxiv icon

SVLL: Staged Vision-Language Learning for Physically Grounded Embodied Task Planning

Add code
Mar 12, 2026
Viaarxiv icon

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images

Add code
Dec 23, 2025
Viaarxiv icon

MaP-AVR: A Meta-Action Planner for Agents Leveraging Vision Language Models and Retrieval-Augmented Generation

Add code
Dec 22, 2025
Viaarxiv icon